Robust Fuzzy Classification Maximum Likelihood Clustering with Multivariate t-Distributions
نویسندگان
چکیده
Mixtures of distributions have been used as probability models for clustering data. Classification maximum likelihood (CML) procedure is a popular mixture of maximum likelihood approach to clustering. Yang (1993) extended CML to fuzzy CML (FCML) for a normal mixture model, called FCML-N. However, normal distributions are not robust for outliers. In general, t-distributions should be more robust to outliers than normal distributions. In this paper we consider FCML with multivariate t-distributions and then create a robust clustering algorithm, called FCML-T. To compare with the expectation & maximization for multivariate t-distributions (EM-T), the proposed FCML-T uses a much simpler equation for solving the degrees of freedom. Some numerical and real experimental examples are used to compare the FCML-T with FCML-N, EM for normal mixtures (EM-N) and EM-T. The results demonstrate the superiority and usefulness of the proposed FCML-T algorithm.
منابع مشابه
Estimation of parameters in latent class models using fuzzy clustering algorithms
A mixture approach to clustering is an important technique in cluster analysis. A mixture of multivariate multinomial distributions is usually used to analyze categorical data with latent class model. The parameter estimation is an important step for a mixture distribution. Described here are four approaches to estimating the parameters of a mixture of multivariate multinomial distributions. Th...
متن کاملRobust mixture clustering using Pearson type VII distribution
A mixture of Student t-distributions (MoT) has been widely used to model multivariate data sets with atypical observations, or outliers for robust clustering. In this paper, we developed a novel robust clustering approach bymodeling the data sets usingmixture of Pearson type VII distributions (MoP). An EM algorithm is developed for the maximum likelihood estimation of the model parameters. An o...
متن کاملA robust Fuzzy Classification Maximum likelihood Clustering Framework
In 1993, Yang first extended the classification maximum likelihood (CML) to a so-called fuzzy CML, by combining fuzzy c-partitions with the CML function. Fuzzy c-partitions are generally an extension of hard c-partitions. It was claimed that this was more robust. However, the fuzzy CML still lacks some robustness as a clustering algorithm, such as its inability to detect different volumes of cl...
متن کاملA Novel Multimodal Probability Model for Cluster Analysis
Cluster analysis is a tool for data analysis. It is a method for finding clusters of a data set with most similarity in the same group and most dissimilarity between different groups. In general, there are two ways, mixture distributions and classification maximum likelihood method, to use probability models for cluster analysis. However, the corresponding probability distributions to most clus...
متن کاملA fuzzy k-partitions model for categorical data and its comparison to the GoM model
The grade of membership (GoM) model uses fuzzy sets as memberships of each individual to extreme profiles (or classes) on the likelihood function of multivariate multinomial distributions. The GoM clustering algorithm derived from the GoM model is used in cluster analysis for categorical data, but it is iterated with complicated calculations. In this paper we create another approach, termed a f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014